IDACT: Automating Data Discovery and Compilation
نویسندگان
چکیده
IDACT serves as an automated middle-layer which acts as an interface between the user and the heterogeneous data sources. Data sources are registered with the middle-layer, and a user can then request data from any (or all) of the available sources without having knowledge of the specific implementation details and formats. The data returned will be automatically converted from the native format to the formats required by the scientific models or analysis tools. Finally, the intermediate data and the products of the models may be archived in a data warehouse for re-use in the future. This approach allows the researcher to focus on the analysis of results, while ensuring that both the hardware and the researcher’s time are used as efficiently as possible.
منابع مشابه
IDACT Query Manager for Heterogeneous Dataset Assimilation
There exists a wealth of data in heterogeneous formats related to NASA science missions. The Query Manager (QM) for the IDACT System (funded under AIST02-0135) minimizes the efforts required of the scientific researcher to obtain and format datasets relevant to the scientific research domain. The QM accepts a general topical request from the researcher; this request is analyzed in the context o...
متن کاملDiscovery Informatics in Biological and Biomedical Sciences: Research Challenges and Opportunities
New discoveries in biological, biomedical and health sciences are increasingly being driven by our ability to acquire, share, integrate and analyze, and construct and simulate predictive models of biological systems. While much attention has focused on automating routine aspects of management and analysis of "big data", realizing the full potential of "big data" to accelerate discovery calls fo...
متن کاملA general compilation algorithm to parallelize and optimize counted loops with dynamic data-dependent bounds
We study the parallelizing compilation and loop nest optimization of an important class of programs where counted loops have a dynamically computed, data-dependent upper bound. Such loops are amenable to a wider set of transformations than general while loops with inductively defined termination conditions: for example, the substitution of closed forms for induction variables remains applicable...
متن کاملAutomating the Semantic Annotation of Geodata
The ability to represent geospatial semantics is of great importance when building geospatial applications for the web. It will not only enhance discovery and retrieval of geographic information, but it will also enable its translation and reuse in contexts other than the original one. We propose a method for automating the semantic annotation of geodata based on spatial characteristics and sug...
متن کاملDynamic Compilation for Reducing Energy Consumption of I/O-Intensive Applications
Tera-scale high-performance computing has enabled scientists to tackle very large and computationally challenging scientific problems, making the advancement of scientific discovery at a faster pace. However, as computing scales to levels never seen before, it also becomes extremely data intensive, I/O intensive, and energy consuming. Amongst these, I/O is becoming a major bottleneck, impeding ...
متن کامل